Repseek, a tool to retrieve approximate repeats from large DNA sequences

نویسندگان
چکیده

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Repseek, a tool to retrieve approximate from large DNA sequences

The importance of genome redundancy has been strongly emphasized in the field of genome dynamics and evolution as well as in medical biology. A repeat is a sequence present twice or more with a high degree of similarity within a larger sequence (e.g. a chromosome) or set of sequences (e.g. a genome with several chromosomes). Each instance of the repeated sub-sequence is called a ’copy’ of the r...

متن کامل

Repseek, a tool to retrieve approximate repeats from large DNA sequences

UNLABELLED Chromosomes or other long DNA sequences contain many highly similar repeated sub-sequences. While there are efficient methods for detecting strict repeats or detecting already characterized repeats, there is no software available for detecting approximate repeats in large DNA sequences allowing for weighted substitutions and indels in a coherent statistical framework. Here, we presen...

متن کامل

A perceptual hash function to store and retrieve large scale DNA sequences

This BLOCKIN BLOCKIN paper BLOCKIN BLOCKIN proposes BLOCKIN BLOCKIN a BLOCKIN BLOCKIN novel BLOCKIN BLOCKIN approach BLOCKIN BLOCKIN for BLOCKIN BLOCKIN storing BLOCKIN BLOCKIN and BLOCKIN BLOCKIN retrieving BLOCKIN BLOCKIN massive BLOCKIN BLOCKIN DNA BLOCKIN BLOCKIN sequences. BLOCKIN BLOCKIN The method is based on a perceptual hash function, commonly used to determine the similarity between d...

متن کامل

Searching for Supermaximal Repeats in Large DNA Sequences

We study the problem of finding supermaximal repeats in large DNA sequences. For this, we propose an algorithm called SMR which uses an auxiliary index structure (POL), which is derived from and replaces the suffix tree index STTD64 [1]. The results of our numerous experiments using the 24 human chromosomes data indicate that SMR outperforms the solution provided as part of the Vmatch [2] softw...

متن کامل

Tandem repeats finder: a program to analyze DNA sequences.

A tandem repeat in DNA is two or more contiguous, approximate copies of a pattern of nucleotides. Tandem repeats have been shown to cause human disease, may play a variety of regulatory and evolutionary roles and are important laboratory and analytic tools. Extensive knowledge about pattern size, copy number, mutational history, etc. for tandem repeats has been limited by the inability to easil...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Bioinformatics

سال: 2006

ISSN: 1367-4803,1460-2059

DOI: 10.1093/bioinformatics/btl519